Multi-Agent Imitation Learning for Driving Simulation

نویسندگان

Raunak P. Bhattacharyya

Derek J. Phillips

Blake Wulfe

Jeremy Morton

Alex Kuefler

Mykel J. Kochenderfer

چکیده

Simulation is an appealing option for validating the safety of autonomous vehicles. Generative Adversarial Imitation Learning (GAIL) has recently been shown to learn representative human driver models. These human driver models were learned through training in single-agent environments, but they have difficulty in generalizing to multi-agent driving scenarios. We argue these difficulties arise because observations at training and test time are sampled from different distributions. This difference makes such models unsuitable for the simulation of driving scenes, where multiple agents must interact realistically over long time horizons. We extend GAIL to address these shortcomings through a parameter-sharing approach grounded in curriculum learning. Compared with single-agent GAIL policies, policies generated by our PS-GAIL method prove superior at interacting stably in a multi-agent setting and capturing the emergent behavior of human drivers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Embodied imitation-enhanced reinforcement learning in multi-agent systems

Imitation is an example of social learning in which an individual observes and copies another’s actions. This paper presents a new method for using imitation as a way of enhancing the learning speed of individual agents that employ a well-known reinforcement learning algorithm, namely Q-learning. Compared to other research that uses imitation with reinforcement learning, our method uses imitati...

متن کامل

Burn-In Demonstrations for Multi-Modal Imitation Learning

Recent work on imitation learning has generated policies that reproduce expert behavior from multi-modal data. However, past approaches have focused only on recreating a small number of distinct, expert maneuvers, or have relied on supervised learning techniques that produce unstable policies. This work extends InfoGAIL, an algorithm for multi-modal imitation learning, to reproduce behavior ove...

متن کامل

Learning Imitation Strategies Using Cost-based Policy Mapping and Task Rewards

Learning by imitation represents a powerful approach for efficient learning and low-overhead programming. An important part of the imitation process is the mapping of observations to an executable control strategy. This is particularly important if the capabilities of the imitating and the demonstrating agent differ significantly. This paper presents an approach that addresses this problem by o...

متن کامل

Cost-Based Policy Mapping for Imitation

Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This is particularly important if the behavioral capabilities of the observed and imitating agent differ significantly. This paper presents an approach that addresses this problem by loc...

متن کامل

Adaptive Cost-Based Policy Mapping for Imitation

ADAPTIVE COST-BASED POLICY MAPPING FOR IMITATION Publication No. ______ SRICHANDAN VENKAT GUDLA, M.S. The University of Texas at Arlington, 2003 Supervising Professor: Manfred Huber Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2018

Multi-Agent Imitation Learning for Driving Simulation

نویسندگان

چکیده

منابع مشابه

Embodied imitation-enhanced reinforcement learning in multi-agent systems

Burn-In Demonstrations for Multi-Modal Imitation Learning

Learning Imitation Strategies Using Cost-based Policy Mapping and Task Rewards

Cost-Based Policy Mapping for Imitation

Adaptive Cost-Based Policy Mapping for Imitation

عنوان ژورنال:

اشتراک گذاری